AITopics | empirical covariance matrix

Since the MPLE objective function for LDFA-H given in Eq. (9) is not guaranteed convex, an EM-algorithm may find a local minimum according to a choice of the initial value. Hence a good initialization is crucial to a successful estimation. According to the equivalence between CCA and probablistic CCA shown by A. Anonymous, it gives (r 1) (r 1) (r 1) (r 1) Lasso problem is solved by the P-GLASSO algorithm by Mazumder et al. (2010). We simulated realistic data with known cross-region connectivity as follows. Notice that the amplitudes of the top four factors dominate the others.

em-algorithm, information flow, matrix, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

Add feedback

Bayesian neural networks with interpretable priors from Mercer kernels

Alberts, Alex, Bilionis, Ilias

arXiv.org Machine LearningOct-29-2025

Quantifying the uncertainty in the output of a neural network is essential for deployment in scientific or engineering applications where decisions must be made under limited or noisy data. Bayesian neural networks (BNNs) provide a framework for this purpose by constructing a Bayesian posterior distribution over the network parameters. However, the prior, which is of key importance in any Bayesian setting, is rarely meaningful for BNNs. This is because the complexity of the input-to-output map of a BNN makes it difficult to understand how certain distributions enforce any interpretable constraint on the output space. Gaussian processes (GPs), on the other hand, are often preferred in uncertainty quantification tasks due to their interpretability. The drawback is that GPs are limited to small datasets without advanced techniques, which often rely on the covariance kernel having a specific structure. To address these challenges, we introduce a new class of priors for BNNs, called Mercer priors, such that the resulting BNN has samples which approximate that of a specified GP. The method works by defining a prior directly over the network parameters from the Mercer representation of the covariance kernel, and does not rely on the network having a specific structure. In doing so, we can exploit the scalability of BNNs in a meaningful Bayesian way.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Machine Learning

2510.23745

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.67)

Add feedback

MMGP_supplementary_material

fabie

Neural Information Processing SystemsOct-10-2025, 23:29:14 GMT

Details regarding the datasets are provided in Appendix A. Morphing strategies and dimensionality Regarding the AirfRANS dataset, the reader is referred to [14]. Examples of input geometries are shown in Figure 6 together with the associated output pressure fields. The output scalars of the problem are obtained by post-processing the three-dimensional velocity. Examples of input geometries are shown in Figure 7. Figure 8: ( Tensile2d) Illustration of the Tutte's barycentric mapping used in the morphing stage. Notice that although these morphing techniques are called "mesh A zoom of the RBF morphing close to the airfoil for test sample 787 is illustrated in Figure 10.

artificial intelligence, machine learning, mesh, (16 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.69)

Add feedback

ecffd829f90b0a4b6aa017b6df15904f-Paper-Conference.pdf

Neural Information Processing SystemsOct-9-2025, 10:57:05 GMT

artificial intelligence, machine learning, theorem 4, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
Europe > Italy > Apulia > Bari (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

A Miscellaneous Results and Supporting

Neural Information Processing SystemsOct-3-2025, 09:26:14 GMT

A.1 Properties of Stable Distributions We will use the following property of stable distributions: Lemma A.1. By integrating the tail bound from the previous result, we get the following simple corollary. Corollary A.2. F or fixed 0 0 .

algorithm 4, data structure, probability, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

CoVariance Filters and Neural Networks over Hilbert Spaces

Battiloro, Claudio, Cavallo, Andrea, Isufi, Elvin

arXiv.org Artificial IntelligenceSep-18-2025

CoVariance Neural Networks (VNNs) perform graph convolutions on the empirical covariance matrix of signals defined over finite-dimensional Hilbert spaces, motivated by robustness and transferability properties. Yet, little is known about how these arguments extend to infinite-dimensional Hilbert spaces. In this work, we take a first step by introducing a novel convolutional learning framework for signals defined over infinite-dimensional Hilbert spaces, centered on the (empirical) covariance operator. We constructively define Hilbert coVariance Filters (HVFs) and design Hilbert coVariance Networks (HVNs) as stacks of HVF filterbanks with nonlinear activations. We propose a principled discretization procedure, and we prove that empirical HVFs can recover the Functional PCA (FPCA) of the filtered signals. We then describe the versatility of our framework with examples ranging from multivariate real-valued functions to reproducing kernel Hilbert spaces. Finally, we validate HVNs on both synthetic and real-world time-series classification tasks, showing robust performance compared to MLP and FPCA-based classifiers.

artificial intelligence, empirical covariance matrix, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2509.13178

Country: North America > United States (0.28)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback

A EM-algorithm to fit LDF A-H (Section 2) Initialization Let null θ

Neural Information Processing SystemsAug-16-2025, 04:21:49 GMT

Since the MPLE objective function for LDFA-H given in Eq. (9) is not guaranteed convex, an EM-algorithm may find a local minimum according to a choice of the initial value. Hence a good initialization is crucial to a successful estimation. According to the equivalence between CCA and probablistic CCA shown by A. Anonymous, it gives (r 1) (r 1) (r 1) (r 1) Lasso problem is solved by the P-GLASSO algorithm by Mazumder et al. (2010). We simulated realistic data with known cross-region connectivity as follows. Notice that the amplitudes of the top four factors dominate the others.

em-algorithm, information flow, matrix, (16 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.71)

Add feedback

The noise level in linear regression with dependent data

Ziemann, Ingvar, Tu, Stephen, Pappas, George J., Matni, Nikolai

arXiv.org Machine LearningOct-27-2023

Ordinary least squares (OLS) regression from a finite sample is one of the most ubiquitous and widely used technique in machine learning. When faced with independent data, there are now sharp tools available to analyze its success optimally under relatively general assumptions. Indeed, a non-asymptotic theory matching the classical asymptotically optimal understanding from statistics [van der Vaart, 2000] has been developed over the last decade [Hsu et al., 2012, Oliveira, 2016, Mourtada, 2022]. However, once we relax the independence assumption and move toward data that exhibits correlations, the situation is much less well-understood--even for a problem as seemingly simple as linear regression. While sharp asymptotics are available through various limit theorems, there are no general results matching these in the finite sample regime. In this paper, we study the instance-specific performance of ordinary least squares in a setting with dependent data--and in contrast to much contemporary work on the theme--without imposing realizability.

artificial intelligence, machine learning, theorem 4, (15 more...)

arXiv.org Machine Learning

2305.11165

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Pennsylvania (0.04)
Europe > Italy > Apulia > Bari (0.04)

Genre: Research Report (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.71)

Add feedback

Filters

Collaborating Authors

empirical covariance matrix

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ecffd829f90b0a4b6aa017b6df15904f-Paper-Conference.pdf

MMGP_supplementary_material

A EM-algorithm to fit LDF A-H (Section 2) Initialization Let null θ

Bayesian neural networks with interpretable priors from Mercer kernels

MMGP_supplementary_material

ecffd829f90b0a4b6aa017b6df15904f-Paper-Conference.pdf

A Miscellaneous Results and Supporting

CoVariance Filters and Neural Networks over Hilbert Spaces

A EM-algorithm to fit LDF A-H (Section 2) Initialization Let null θ

The noise level in linear regression with dependent data